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Abstract 

Estimation of the parameters of an exponential distribution based on record data 
has been treated by Samaniego and Whitaker (1986) and Doostparast (2009). Re- 
cently, Doostparast and Balakrishnan (2011) obtained optimal confidence intervals as 
well as uniformly most powerful tests for one- and two-sided hypotheses concerning 
location and scale parameters based on record data from a two-parameter exponen- 
tial model. In this paper, we derive optimal statistical procedures including point 
and interval estimation as well as most powerful tests based on record data from a 
two-parameter Pareto model. For illustrative purpose, a data set on annual wages of 
a sample of production-line workers in a large industrial hrm is analyzed using the 
proposed procedures. 

Keywords and phrases: Generalized likelihood ratio test; Invariant test; Monotone like- 
lihood ratio; Shortest-width confidence interval; Two-parameter Pareto model; Uniformly 
most powerful test. 

1 Introduction 



Let Xi,X%,X3, • • • be a sequence of continuous random variables. is a lower record 
value if it is smaller than all preceding values Xi, A*2, ■ ■ ■ , -Xjfc-i and by definition, X\ is 
taken as the first lower record value. An analogous definition can be provided for upper 
record values. Such data may be represented by (R,K) := (Ri, K%, ■ ■ ■ , R m , K m ), 
where Ri is the i-th. record value meaning new minimum (or maximum) and Ki 
is the number of trials following the observation of Ri that are needed to obtain 
a new record value Throughout this paper, we denote the observed value 

of these record data by (r,k) := (ri, k\, • ■ ■ , r m , k m ). Record statistics arise naturally in 
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many practical problems and in applied fields such as athletic events (Kuper and Sterken, 
2003), Biology (Krug and Jain, 2005), catastrophic loss (Hsieh, 2004 and Pfeifer, 1997), 
climate research (Benestad, 2003), financial markets (Bradlow and Park, 2007 and de 
Haan et al, 2009), industrial application (Samaniego and Whitaker, 1986 and 1988), 
spatial patterns (Yang and Lee, 2007), and traffic analysis (Glick, 1978). Hence, finding 
optimal statistical inferential procedures based on record data becomes very important 
and useful from a data-analysis point of view. 

The rest of this article is organized as follows. In Section 2, we present briefly the 
notation to be used through out the paper and also the form of Pareto distribution to 
be studied here. In Section 3, we describe the basic form of record data to be considered 
and the corresponding likelihood function. In Section 4, we discuss the optimal point 
estimation of the Pareto parameters, while the interval estimation is handled in Section 
5. Tests of hypotheses concerning the parameters are discussed in Section 6 and finally 
a numerical example is presented in Section 7 in order to illustrate all the inferential 
procedures developed here. 

2 Some Preliminaries 



Throughout the paper, we will use the following notation: 



Exp(n, a) 


Exponential distribution with location fj, and scale a 


Gamma(n, a) 


Gamma distribution with shape n and scale a 


Par(p,a) 


Pareto distribution with scale f3 and shape a 


Xti 


Chi-square distribution with v degrees of freedom 




1007 t/l percentile of the chi-square distribution with v degrees of freedom 


(r,k) 


{ r ii k\, ■ ■ ■ , r m , k m ) 


(R,K) 


(R\,Ki, • • • , R m , K m ) 




Yli^i Ki, the time of occurrence of the m-th record 


T* 


£™i^(log^-log/3) 






T(r) 


/ °° x r ~ 1 e~ x dx, the complete gamma function 


6m 


Maximum likelihood estimator of 9 


0u 


Unbiased estimator of 6 



A random variable X is said to have a Pareto distribution, denoted by X ~ Par(j3, a), 
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if its cumulative distribution function (cdf ) is 




a 



x > f3 > 0, a > 0, 



(2.1) 



and the probability density function (pdf) is 



f(x; (3, a) = a(3 a x 



-(a+l) 



x > (3 > 0, a > 0. 



(2.2) 



For a through discussion on various properties and applications and different forms of 
Pareto distribution, one may refer to Arnold (1983) and Johnson, Kotz and Balakrishnan 
(1994). 

3 Form of Data 

As in Samaniego and Whitaker (1986) and Doostparast (2009), our starting point is a 
sequence of independent random variables Xi, X2, X3, ■ ■ ■ drawn from a fixed cdf F(-) 
and pdf /(■)■ We assume that only successive minima are observable, so that the data 
may be represented as (r, k) := (n, k±, ri, k>2, • • ■ ,r m ,k m ), where ri is the value of the 
i-th observed minimum, and ki is the number of trials required to obtain the next new 
minimum. The likelihood function associated with the sequence {r±,ki,--- ,r m ,k m } is 
given by 



where Tq = 00, k m = 1, and Ia(x) is the indicator function of the set A. 

The above described scheme is known as inverse sampling scheme. Under this scheme, 
items are presented sequentially and sampling is terminated when the m-th minimum is 
observed. In this case, the total number of items sampled is a random number, and K m 
is defined to be one for convenience. There is yet another common scheme called random 
sampling scheme that is discussed in the literature. Under this scheme, a random sample 
Y"i, ■ ■ ■ ,Y n is examined sequentially and successive minimum values are recorded. In this 
setting, we have , the number of records obtained, to be random and, given a value 
of to, we have in this case YliLi Ki = n - 

Remark Doostparast and Balakrishnan (2011) derived classical estimators for Exp(0, a)- 
model under both inverse and random sampling schemes, and also discussed associated 
cost-benefit analysis. 



rn 



L(r,k) 



-oo,rj_i)> 



(3.1) 
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4 Point Estimation 



Let us now assume that the sequence {Ri, K\, ■ ■ ■ , R m , K m = 1} is arising from Par(f3, a) 
in (12.11). Then, the likelihood function in (13.11) becomes 



L(/3,a;r,k) = ^ akt+1 , < (3 < r m , a > 0, (4.1) 



and so the log-likelihood function is 



a; r, k) = mlna - fcj(lnrj - ln/3) - ^lnrj, < /3 < r m , a > 0. (4.2) 

Since -§pl{fi, a; r, k) = a/3 -1 ^« > 0' KPi Q ! r i k) is increasing with respect to /3. This 
implies that 

hi = Rm- (4.3) 
Substituting (14. 3D in (14. 2p . the maximum likelihood estimate of a is readily obtained as 
«M = ^- (4-4) 

Furthermore, (^2™ =1 Ki In Ri,T m , R m ) is a joint sufficient statistic for (/3, a). 

Corollary 4.1 It can be shown that T* and T£ are distributed as Gamma{m,a~ l ) and 
Gamma(m — l,a ), respectively. 

From Corollarv l4.lt an unbiased estimator for a is given by 
m — 1 



au 
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In the following, we show that olm dominates ajj, under square error (SE) loss function. 
In other words, ajj is inadmissible under SE loss function. First, we need the following 
lemma. 

Lemma 4.2 Suppose X has a Gamma{u,T) distribution. Then, 

E{X- k ) = r- fe ^^, k<u. 
Proposition 4.3 For m > A, under the SE loss function, &m dominates au- 
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Proof From Lemma 14.21 we have 
MSE{&m) '■= E {&M — ct) 2 

, m 



2 2 r(m-l-2) 2 2 r(m-i-i) 
m a — — r \- a — 2a m- 



a 2 < - — + 1 



r(m — 1) r(m — 1) 



m 2 2m 



(m — 2){m — 3) m — 2 



= (4.5) 
(m-2)(m-3) v ; 

Replacing m with m — 1 in ()4.5|) . we immediately have 

MSE(a v ) = a \ ( ™ ± 5) - . (4.6) 
v uy (m-3)(m-4) v ; 

Thus, the efficiency of djw with respect to a;y is given by 

MSE(&u) 



EFF(a M ,ay) - mse{&m) 

(m + 5) (m — 2){m — 3) 
(m — 3)(m — 4) (m + 6) 

(m + 5)(m — 2) 
(m + 6)(m — 4) 

m + 14 
(m + o)(m — 4) 

> 1, 

which is the desired result. □ 



One can easily check that the bias of &m, under the SE loss function, is 

Bse(&m, ot) = E(&m) — a = 2a/(m — 2). 

It may be noted that 

lim EFF(a.M, &u) = 1- 

This is to be expected since &m and ay are equivalent for large values of m. Figure [1] 
shows the relative efficiency of olm with respect to ay. 
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Figure 1: Relative efficiency of &m with respect to &u given by (|4.7|) . 
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5 Confidence intervals 



Suppose we observe (r, k) from a two-parameter Pareto distribution in (|2.ip . In this 
section, we discuss the construction of exact confidence intervals for the two parameters 
in different cases. 



5.1 a known 

Suppose the shape parameter a is known. Then, from (|4.1|) . we have (T m ,R m ) to be a 
joint sufficient statistic for [3. Since T m is distributed free from parent distribution (Glick, 
1978), we consider two approaches for obtaining confidence intervals for (5 on the basis of 
record data. 



Unconditional method 



To obtain a confidence interval for (3, we need the following lemma due to Doostparast 
and Balakrishnan (2011). 

Lemma 5.1 Suppose we observe (r, k) from a two-parameter Exp(fi, a) -distribution. Then 



Rr 



g{x;m) 



{— ln(l — exp(— x))} 



m—l 



■ exp(— x), x > 0. 



(5.1) 



a T(m) 
Lemma 5.2 If X ^ Par(fi,a), then InX ~ Exp(lnj3,a~ 1 ). 

From Lemmas 15.11 and 15.21 it can be shown in this case that 

a QnRm — ln/3) ~ g(x; m). (5-2) 

This implies that — ln(l — exp {— a (In R m — ln/3)}) has a gamma distribution with pa- 
rameters (m,l), and therefore, 

"K 1 (5 - 3> 

Hence, an equi-tailed 100(1 — 7)% confidence interval for f3 is given by 
/ 



Iet,i(P) 



Rr, 



\ 



1 — exp 



v 2 



•,Rr, 



i 



1 — exp 



X 



2m,l-- 



(5.4) 



Suppose we restrict our attention to intervals of the form (aR m , bR m ), where < a < b. 
Since the function g(x;m) in f)5. 1 j) is decreasing with respect to x for every m > 1, the 
100(1 — 7)% confidence interval for (3 with minimum width in this subclass of intervals is 



Iml,x(P) = {Rm exp j - 9m ^ 7 j , R Tl 



(5.5) 



where g ma is 1007-th percentile of the pdf g(x; m) in (|5.1 
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Conditional method 



Since minimum of a random sample of size n from Exp(0, a) has a Exp(0, <r/n)-distribution 
[see Arnold, Balakrishnan and Nagaraja (1992)], conditional on T m = j for j > m, the 
random variable aj(\aR m — ln/3) has a standard exponential distribution. Therefore, a 
conditional equi-tailed 100(1 — 7)% confidence interval for f3 is given by 



Rrn a \l Rm °W 1 ~~ ~ 



This implies that an equi-tailed 100(1 — 7)% confidence interval for /3 is 



Iet,i,c{P) 



R r , 



(5) 



i_ 



, Rm ( 1 



The expected width of the interval in (|5.4p is 

L(lET,l(P)) = E ( R m) 





1 — exp < 






1 — exp < 




i) 




I 2 J 


h 


I 2 J 















while the expected width of the interval in (15. 6ft is 



(5.6) 



(5.7) 



= J2 E [ Rm \( 1 



2, 



2, 



r m = j ^(^m = j) 



j=m 



£ (1-? 



7 \ 

2 



E(R m \T m =j)P(T m = j). 



Again, since the minimum of a random sample of size n from Exp(0, a) has a Exp(0,a/n)- 
distribution, from Lemma 15.21 we conclude that lnR m \ (T m = j) has a Exp(ln[3, {aj)~ l ) 
distribution. So, 

E(R m \T m =j) = £(exp{lni? m }|T m = j) 



In 

ja-1' 
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Therefore, 



L(Ikt,cM) 



E (i 



7 



ja - 1 



P(T m = j) 



t . |<N _2 )a fe_ ( 2 )5 fei 



, £H(1 _ i) A_ (i) ^}_^_ 



Hence, for computing the expected width of the interval in (j5.6l) . we need the probability 
mass function of T m . From Sibuya and Nishimura (1997), we have 



P{T m = j) 



1 



J - 

m 



1 



j > m, 



(5.8) 



where brackets [ ] denote unsigned Stirling numbers of the first kind defined by the poly- 
nomial identity 



z(z + l) 



(z + n-1) 



E 

m=l 



n 
m 



Now, let H(-) be an arbitrary function. Then, Doostparast and Balakrishnan (2009) 
showed that 

T m H{Tm — 1) 



E{H(T m )) = E 



T m -i 



(5.9) 



and this formula may be used for obtaining the required expectations by taking a suit- 
able choice for the function H(-). However, no explicit expression seems possible for 
E [a aT ™ r^F ma i ) and so a simulation study was carried out to generate sequences of inde- 

pendent observations based on which the desired estimates were calculated for E (a aT <r ' ' 
in the illustrative examples. 



T m a-1 



Remark The theory of uniformly most powerful (UMP) one-sided test can be applied to 
the problem of obtaining a lower or upper bounds. In Section [61 we will obtain uniformly 
most accurate (UMA) lower and upper bounds for f3. 



5.2 f3 known 

If f3 is known, then is a complete sufficient statistic for a, and so confidence intervals 
can be based on this statistic. Since T* is distributed as Gamma(m, a -1 ), we have 

2«T , 1 *~ X ( 2m) . (5-10) 
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7 


2 


3 


rn 
4 


5 


6 


7 


0.10 


0.167630 


0.882654 


1.874590 


3.017327 


4.258219 


5.569586 




7.864292 


10.958349 


13.892227 


16.710795 


19.446252 


22.118958 


0.05 


0.084727 


0.607001 


1.425002 


2.413920 


3.516159 


4.700465 




9.530336 


12.802444 


15.896592 


18.860434 


21.728898 


24.524694 


0.01 


0.017469 


0.263963 


0.785646 


1.497847 


2.344412 


3.291176 




13.285448 


16.901320 


20.295553 


23.532765 


26.653130 


29.683220 



Table 1: Values of a (the upper figure) and b (the lower figure) in ()5.12p for 7 = 
0.01,0.05,0.1 and different choices of m. 



Therefore, in practice, one may use equi-tailed 100(1 — 7)% interval of the form 
Iet,x(<x) := 



'2 2 
^2to,7/2 ^2m,l-(7/2) 



h(a,b)= (—,— ), 0<a<6. (5.11) 



1 2T{ 2T* I 

Suppose we restrict ourselves to a class of intervals of the form 
a b 

We then need to find a and b that minimizes the width of the interval in (|5.1ip subject to 
the confidence coefficient being 1 — 7. Using Lagrange method, we then need to solve the 
following equations for a and b, determining Iml(oi), as 

h 2 m(x)dx = 1-7 and h 2m (a) = h 2m (b), (5.12) 

where h v {x) is the density function of a chi-square distribution with v degrees of freedom 
given by 

hv{x) = 2^Tx|) ^ /2 ~ lex p(-f)' x>0 - ( 5 - 13 ) 

Table [T] presents values of a and b up to six decimal places that satisfy the conditions in 
(I5TT21 . 

Suppose that the random variable X has a Gamma(v, redistribution, where v is a 
known constant. A UMP test does not exist for testing Hq : t = tq against the alternative 
Hi : r ^ To (Lehmann, 2000, p. 111). So, there are no UMA bounds for a. However, the 
acceptance region of the UMP unbiased test is 

IX 

C!< — < C 2 , 
TO 
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where C\ and C 2 are obtained from the equations 

-c 2 



f 2 h 2v (x)dx = l- 1 and C^e~ Cl/2 = C%e~ c 1 1 



This yields UMA unbiased bounds for a as 
Ci C 2 



2T?' 2T*J ' 



(5.14) 



with 

rC 2 



[ 2 h 2m {x)dx = 1 -7 and Cfe" 01 / 2 = C 2 m e - C2 / 2 . 



Corollary 5.3 E/MA unbiased and minimum width intervals in the class (|5.1ip given by 
(|5.14p and (I5.12p . respectively, are identical. 

Remark Prom Lehmann (2005, p. 72, Theorem 3.5.1) and (|5,10p . the acceptance region 
of the most powerful test of Hq : a = ao against H\ : a < ao is 2a^T^ < C u , where C u is 
determined by the equation 

Cu 

h 2m (x)dx = 1-7. 





Therefore, — ^f- — - is a UMA upper confidence bound for a. Similarly, — — is a UMA 
2Tf 2Tf 

lower confidence bound for a, where Cl is such that 
h 2m (x)dx = 1-7, 

c L 
or 

h 2m (x)dx = 7. 







That is, — — ^- is a uniformly most accurate lower confidence bound for a (without the 
2T X 

restriction of unbiasedness). For more details, one may refer to Lehmann (2005) and 
Pachares (1961) for tables of C\ and C 2 . 



5.3 (3 and a both unknown 

From (|4.ip . the statistic (T 2 * , T m , i? m ) is jointly sufficient for /3 and a. Therefore, confidence 
intervals may be developed based on this statistic. 
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Confidence interval for a 

From Lemma [521 a ~ l ls a scale parameter for data (R[ , K\ , • • • ,R' m ,K m ), where R[ = 
In Ri for 1 < i < m. Therefore, the assumption that the limits remain unchanged upon 
the addition of a constant to all log-record values (R^) seems reasonable and leads to 
intervals depending only on . For convenience, we restrict ourselves to multiples of T$ 
for intervals of the form 



2T* ' 2T* 



< a < b. 



(5.15) 



Now, since is distributed as Gamma(m — 1, a -1 ), we have 2aT^ ~ X2(m-iV we can 
use the conditions in (]5. 12|) for obtaining the minimum width confidence interval simply 
by replacing m and T* by m — 1 and T£ , respectively. 



Confidence interval for (3 

First, we need the following lemma of Doostparast and Balakrishnan (2011). 

Lemma 5.4 Suppose the random variable U is distributed with pdf g(x; m) as in f)5. 1 j) 
(to > 1) and T is a chi-square random variable with v degrees of freedom. IfU and T are 
independent, then the pdf ofW:=^- is given by 

1 /"OO 

fa( w) = wl+ , /2r(m)rW2)2 / o {-ln(l-e-)r-V/V^W (5.16) 

Since the random variable R m and T% are independent (Arnold et at, 1998) and that 
2aT^ is distributed as chi-square with 2(m — 1) degrees of freedom, by Lemma [5^1 we can 
conclude that 



lni? r . 



ln/3 



f w {w;m,2m - 2). 



(5.17) 



So, an equi-tailed 100(1 — 7)% confidence interval is given by 



Rr, 



w 1 ( / 2 ( m ^m-2) 



1 Rn 



Rr, 



A',' 



(5.18) 



where w^(m,u) is the 1007-th percentile of the density in (|5.16l) . For some choices of m 
and 7, the values of w 1 {m, 2m — 2) were obtained by Doostparast and Balakrishnan (2009) 
and these are presented in Table [2j 

Restricting to intervals of the form (aR m , bR m ), where < a < b, the values of a and b 
which minimize the width in this subclass of intervals subject to the confidence coefficient 
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m 


0.01 


0.025 


0.05 


0.95 


0.975 


0.99 


2 


0.00068871 


0.00200001 


0.00462857 


1.28171529 


1.99110929 


3.40093274 


3 


0.00005829 


0.00018945 


0.00048138 


0.19893804 


0.28399815 


0.42122379 


4 


0.00000748 


0.00002678 


0.00007396 


0.05887585 


0.08503410 


0.12545761 


5 


0.00000118 


0.00000457 


0.00001356 


0.02116642 


0.03157779 


0.04776232 


6 


0.00000021 


0.00000088 


0.00000279 


0.00829748 


0.01289715 


0.02026765 


7 


0.00000004 


0.00000018 


0.00000061 


0.00339234 


0.00551395 


0.00908049 


8 


0.00000001 


0.00000004 


0.00000014 


0.00141525 


0.00240988 


0.00415617 



Table 2: Percentiles of the density in (|5.16p for m = 2, • • • ,8 and v = 2m — 2. 



being 1 — 7 can be obtained by solving the following equations: 
f w (|;m,2m -2\= f w (^;m,2m - 2) , 

(5.19) 

fa/T* fw(x; m, 2m - 2)dx = 1-7, 
where fw(w;m,v) is as in (|5.16p . 



6 Tests of Hypotheses 

In this section, we treat tests of hypotheses concerning the two parameters of the Pareto 
distribution in (|2.ip . To this end, we consider the following three cases. 



6.1 a known 



If a is known, then {T m ,R m ) is a joint sufficient statistic for /3. Since T m is an ancillary 
statistic (Glick, 1978), R m is a partially sufficient statistic for /3. The joint pdf of (R., K) 
given by (|4.ip possesses the MLR property in R m . From Theorem 2 of Lehmann (1997, 
p. 78) and (|5.3p . the UMP test of size 7 for testing Hq : f3 < (3q against the alternative 
Hi : /3 > /3 is 

'l, Rm > A) (1 -exp{-ixi m , 7 })~ 1/a , 
<f>(r,k) = l (6.1) 

, 0, Rm< A) (1 - exp {-\xl ma }Y l/a ■ 

By interchanging inequalities throughout, one obtains in an obvious way the solution for 

the dual problem. Thus, the UMP test of size 7 for testing Hq : (3 > (3q against the 

alternative H\ : (3 < (5q is 

'l, R m < Po (1 -exp{-ixL,i-7}r lA \ 
0(r,k) = <^ (6.2) 



k 0, R m > fa (1 - exp {-|xi m> i_ 7 }) 



-1/a 
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The UMP tests in and (JHHJ) imply that the UMP test for testing H : (5 = j3 against 
the alternative Hi : p ^ Pq does not exist. 



From (|4.ip . the likelihood ratio statistic is obtained as A = (Po/R m ) m for r m > /3q 
and A = for r m < Pq. Thus, the critical region of the GLR test of level 7 is C = {(r, k) : 
aT m {\ogR m - log /3o) < c, or R m < f3 }. Since aT m (logR m - log/3 )|(T m = j) has a 
standard exponential distribution and T m is distributed free from the parent distribution 
(Glick, 1978), we have the following proposition. 

Proposition 6.1 Critical region of the GLR test of levels for testing Hq : /3 = fto against 
the alternative Hi : (5 7^ (3q is given by 



c 



(r,k): 



Rrn 



< 7 or Rm < Pq 



(6.3) 



As mentioned earlier, the theory of UMP one-sided test can be applied to the problem of 
obtaining a lower or upper bound. Thus, from (|6.ip and (|6.2p . 100(1 — 7)% UMA lower 
and upper bounds for /3 are given by 



Rr, 



1 



exp 



1 2 

2 X2m,7 



1/a 



, OO 



and 



\0,R m i^l -exp<j --x 2m ,i 
respectively. 



1 1 \ V° N 

1 2 



6.2 /3 known 

If P is known, the statistic T* is a complete sufficient statistic, and so all inference can 
be based on it. Since has a Gamma(m, a -1 )-distribution, and has MLR in — T*, the 
UMP tests of size 7 for testing Hq : a < Qo against the alternative Hi : a > ao and 
-ffo : a > ao against the alternative Hi : a < ao are 



(r,k) 



1. 
0. 



m,7' 



and 



(r,k) 



0, 



2a T* < x| 
otherwise, 

2a T x * > xim,i-7' 
otherwise, 



(6.4) 



(6.5) 
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m 


7 


1 


2 


3 


4 


5 


0.01 


0.0169 


0.0589 


0.3139 


2.2636 


20.7899 


0.02 


0.0332 


0.1136 


0.6004 


4.3280 


39.7496 


0.03 


0.0490 


0.1658 


0.8766 


6.2658 


57.6354 


0.04 


0.0647 


0.2175 


1.1370 


8.1415 


74.3884 


0.05 


0.0797 


0.2664 


1.3850 


9.9380 


90.4894 


0.1 


0.1517 


0.4918 


2.5377 


18.0175 


163.4582 



Table 3: Quantiles of X m exp{— X/2}, where X ~ xlm? f° r some choices of m and 7. 



respectively. Therefore, the UMP test for testing Hq : a = qo against the alternative 
Hi : a 7^ ceo does not exist. One can easily show that the critical region of the GLR test 
of level 7 for testing Hq : a = oq against the alternative Hi : a ^ ao is 

C=|(r,k):Z 1 m e X p|-iz 1 | <A, (6.6) 
where under Hq, Z\ := 2o.qT* ~ xim an d c* is chosen such that 
7 = Pa= ao (zT exp {-^1} < c v 

Table [3] presents simulated critical values for applying the GLR test, obtained by Doost- 
parast and Balakrishnan (2011), for some choices of m and 7. 

6.3 Unknown f3 and a 
Hypotheses tests for a 

There is no UMP test for one-sided hypotheses on the scale parameter /3. So, we restrict 
our attention to smaller classes of tests and seek UMP tests in these subclasses. 

The family of densities {Par((3, a) : (3 > 0, a > 0} remains invariant under translations 
R[ = Ri/c, < c < 00. Moreover, the hypotheses-testing problem remains invariant 
under the group of translations, that is, both families of pdfs {Par(f3,a), a > ao} and 
{Par ([3, a), a < ao} remain invariant. On the other hand, the joint sufficient statistic is 
(R m , T m ), which is transformed to (R m /c, , T m ). It follows that the class of invariant 
tests consists of tests that are functions of T£. Since T£ ~ Gamma(m — 1, a" 1 ), the pdf of 
T£ possesses the MLR property in — , and it therefore follows that a UMP test rejects 
Hq : a < ao if < c, where c is determined from the size restriction. Hence, we have the 
following proposition which presents a UMP invariant test for one-sided hypotheses. 
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Proposition 6.2 To test Hq : a < Qo against H\ : a > a®, a UMP invariant test of size 
7 is 



1, 2a T 2 * < X2m-2,7' 

(r.k)- { (6.7) 
0, otherwise, 



and to test Hq : a > ao against H\ : a < ao, a UMP invariant test of size 7 is 

2 

2m— 2,l-7> 



(r,k) 



1, 2a T* > xlr, 



(6.8) 

0, otherwise. 



There is no UMP test for testing Hq : a = oq against the alternative H\ : a 7^ ao- We 
therefore use the GLR procedure for this testing problem. The likelihood ratio function 
is given by 



rn 



A = (J^J exp{-m(Z 2 /2m - 1)}, (6.9) 

where Z2 = IolqT^ which, under Hq, is distributed as chi-square with 2(m — 1) degrees of 
freedom. Hence, the critical region of GLR test at level 7 is 

C = {(r,k):y m exp(-y/2)<a}, (6.10) 

where a is chosen such that 7 = P(Z 2 n exp{— Z2/2} < a) and Z2 ~ x|( m _iy Table [3] 
presents critical values for applying the GLR test for some choices of m and 7. 

Hypotheses tests for j3 

In the case of unknown a, finding a UMP test for one- and two-sided hypotheses remains as 

m 

an open problem. However, c\m = -F^m — ttt, ; ttt is the maximum likelihood 

22i=l Ki{\ogRi - log/3 ) 
estimator of a under Hq : f3 = /3q. This fact and (|4.1|) yield the likelihood ratio statistic, 

for testing Hq : f3 = /3 against the alternative Hi : (3 7^ (3q, as 

A=( (I)" fOT r ™^° , (6.11) 
[ for r m < j3 

where T$ = Z7U KiQogRi - log A))- But, 

2£ _ 1 _ T m (log # m - log/3 ) 
T * ma M ,o 

Therefore, the critical region of the GLR test of level 7 for testing Hq : f3 = (3q against the 
alternative H\ : (5 7^ (3q is given by 

C = {(r,k) : T m (logi? m -log/3 ) > a M ,oC* or R m < #>}, (6.12) 



16 



Table 4: Annual wage data (in multiplies of 100 U.S. dollars). 
112 154 119 108 112 156 123 103 115 107 
125 119 128 132 107 151 103 104 116 140 
108 105 158 104 119 111 101 157 112 115 



Table 5: Record data arising from annual wage data with m = 3. 
~i 1 2 3~ 
Ri 112 108 103 
Ki 3 4 1 



where C* is obtained from the size restriction 
7 = Pj3 Q (r m (log Rm - log /3 ) > a M ,oC*) . 



(6.13) 



An explicit closed-form expression for C* in (|6.12p does not seem to be possible. But, one 
can use the following expression for computational purposes: 

7 = Pg (T m (log R m - log /3 ) > a M ,oC*) 

oo 

= P /?o {T m {\ogR m - log /3 ) > a Mfi C*\T m = j) P{T m = j) 



log R m - log go 

E^i^aog^-iogA) 



> 



—I 

mj 



J - 

m 



(6.14) 



7 Numerical Example 

Dyer (1981) reported annual wage data (in multiplies of 100 U.S. dollars) of a random 
sample of 30 production- line workers in a large industrial firm, as presented in Table HI He 
determined that Pareto distribution provided an adequate fit for data. Assuming inverse 
sampling scheme with m = 3, the corresponding record data are presented in Table 03 
From (|4.3p and (]4.4p . the MLE of f3 and a on the basis of record data are obtained to 
be (3m = 103 and au = 6.804, respectively. From (|5.18p . an equi-tailed 95% confidence 
interval for /3 is obtained to be 

(90.877, 102.991) . 

Similarly, a minimum-width 95% confidence interval for a in the class (15.150 is obtained 
as 



(0.096, 10.807) 
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For testing Hq : a = 6 against the alternative H% : a ^ 6, we have Zi = IqlqT^ = 5.291, 
which gives 



From Table [3l we have a = 0.2664. Therefore, (|6.10j) implies that Hq is not rejected. 
Since we can not find C* in (|6.12p . we can not test the hypothesis Hq : j3 = /3q against the 
alternative H\ : (3 ^ (3q. In this case, one may conduct a simulation study and calculate 
the percentile of A in (|6.1ip by specifying (3q, and then carry out a likelihood-ratio test. 
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